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Abstract 



We consider the mean-field classical Heisenberg model and obtain detailed informa- 
tion about the magnetization by studying the model on a complete graph and sending 
the number of vertices to infinity. In particular, we obtain Cramer- and Sanov-type 
large deviations principles for the magnetization and the empirical spin distribution 
and demonstrate a second-order phase transition in the Gibbs measures. We also study 
the asymptotics of the magnetization throughout the phase transition using Stein's 
method, proving central limit theorems in the sub- and supercritical phases and a 
| nonnormal limit theorem at the critical temperature. 

^ , 1 Introduction and summary of results 

c5 ■ For many models of statistical mechanics, understanding their physical behavior starts with 
understanding the behavior of the corresponding mean-field model-which not only suggests 
how the physical model behaves, but also can predict rather precisely the physical behavior 
in high dimensions. There are two main statistical mechanical models of ferromagnetism: 
the simpler and better-understood Ising model, and the more realistic and more challenging 
(classical) Heisenberg model, on a lattice of dimension d with a spin a, £ § 2 at each lattice 
site i, the spin configuration a £ (S 2 )™ having Hamiltonian energy in the absence of an 
external field (anisotropy): 

hi 
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The nearest-neighbor Heisenberg model has constant interaction Jjj = J for nearest neigh- 
bors % and j, and no interaction Jjj = otherwise. The mean-field version of the Heisenberg 
model has an averaged interaction Jjj = for all i,j and can be understood as sending 
the dimension d — > oo or considering the lattice to be a complete graph on n vertices and 
sending n — > oo. 

The related quantum Heisenberg model has nearest-neighbor interactions of spin matri- 
ces, and in dimensions three and higher, there is a celebrated proof of a phase transition 
to long-range order [TJ. Such a result has been thus far intractable for the nearest-neighbor 
classical Heisenberg model, but the mean-field version is more amenable to rigorous analysis. 
In this article, we prove the existence of a second-order phase transition for the mean-field 
classical Heisenberg model, deriving a number of precise formulas and asymptotics for various 
physical quantities. 

The simpler Ising model of ferromagnetism (the spins are in {+1,-1}) is better under- 
stood, and our results parallel some recent developments for the mean-field version of the 
Ising model, called the Curie- Weiss model. It is believed that the Curie- Weiss model accu- 
rately describes the Ising model in dimensions greater than four, in the sense that they have 
the same critical exponents of various physical quantities (e.g., magnetization, free energy). 
The magnetization (the sum of the spins, appropriately normalized) in the Curie- Weiss model 
was shown by Ellis and Newman [10] to have a Gaussian law in the non-critical regimes and 
law that converges to the distribution with density proportional to e~ x I 12 at the critical 
temperature. Recently, it was shown by Chatterjee and Shao [3] that the magnetization at 
the critical temperature satisfies a Berry-Esseen type error bound of order 1/y/n for this 
non-central limit theorem. See also [2j [8] for analogous results on the Curie- Weiss-Potts 
model with an arbitrary finite number of discrete spins, and [1] for other related models. 

There are a few results known for the Heisenberg model. In dimensions one and two, 
the classical Heisenberg model with nearest-neighbor interactions has no symmetry breaking, 
i.e., there is no phase transition to asymmetric macrostates above a critical temperature, e.g., 
[6]. By contrast, the classical Heisenberg model with long-range interactions or anisotropy 
(an external magnetic field) has a phase transition in two dimensions and higher, see [13] 
and references therein. And for classical, isotropic Heisenberg models with nearest-neighbor 
interactions in three dimensions and higher, the existence of a phase transition was shown 
using Fourier-based infrared bounds [TT] . 

As for studying the mean-field classical Heisenberg model, the large- dimensional (d — > oo) 
limit of the nearest-neighbor model on Z d , with spins in S 2 , has the critical inverse temper- 
ature f} c — 3. Moreover, the large-dimensional limit is known to be a good approximation 
for high- dimensional models (as in Landau's theory of second order phase transitions) in the 
sense that below the critical temperature, the magnetization is zero for all d, and above the 
critical temperature, the magnetization has the correct (non-zero) limit as d — > oo. 

The results in this article for the isotropic classical mean-field (in the complete graph 
sense) Heisenberg model include: 

• Large deviation principles (LDPs) for the magnetization and the empirical spin distri- 
bution for each inverse temperature (3 £ [0, oo) with explicit rate functions and relative 
entropies. (Section 2) 
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• Explicit formulas for the free energy and descriptions of the canonical macrostates and 
the corresponding second-order phase transition. (Section 2) 

• An LDP for the empirical spin distribution with respect to the microcanonical ensem- 
ble, which has fixed energy, descriptions of the microcanonical macrostates and their 
second-order transition. (Section 2) 

• A central limit theorem in the subcritical phase for the magnetization with the usual 
CLT scaling y/ri, using Stein's method. (Section 3) 

• A CLT in the supercritical phase for the magnetization with a more complicated scaling, 
again using Stein's method. (Section 4) 

• A nonnormal limit theorem for the magnetization at the critical temperature, with 
limiting density of the squared length proportional to t 5 e~ 3ct (Section 5), using a new 
abstract Stein's method result for the nonnormal approximation (Appendix). 



2 The model and large deviations results 

We consider the isotropic mean-field classical Heisenberg model on a finite complete graph 
G n with n vertices. That is, at each site of the graph is a spin living in Q = §> 2 , so the state 
space is Q n = (S 2 )™ with P n the ra-fold product of the uniform probability measure on S 2 . 
For this model, the mean- field Hamiltonian energy H n : Q n — > R is: 



1 n 



2n 



The energy per particle is h n (a) = ^H n (a), and the canonical ensemble, or Gibbs measure, 
is the probability measure P n $ on Q n with density (with respect to P n ): 



f(a) := -exp ( — ^ a o) 

l<i<j<n 



}_ e -f3H n (a) = }_ e ~nl3h n (a) _ 



Here the partition function is Z = Z n ((3) = f n exp (j^ Y^i,j=i ( a «> dP n . 

The empirical measure = /i„ i(T of the spins {cxj} is defined to be the random measure 
/V<r := - Y^i=i dot on An interesting physical quantity is the empirical magnetization, 
defined by 

/n 
i=l 

A question of significant interest is the behavior of the magnetization as a function of the 
inverse temperature (3 in the Gibbs measures, so we begin by stating some large deviations 
principles for the fi n>a - The first is an LDP (large deviations principle) for /i nifT in the case 
that = 0, which is an immediate consequence of Sanov's theorem (see; e.g. [5]). 
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Proposition 1. Let {<jj}" =1 be i.i.d. uniform random variables on§ 2 and let ji n ^ := - Y17=i <W 
For a probability measure v on §> 2 , define the relative entropy of v with respect to the uniform 
probability measure \i by 

H{v | fi) := ff g2 /log(/)rf/i f:= t 6XZStS] 
loo otherwise. 

Let Mi(Ei 2 ) denote the probability measures on § 2 with the weak-* topology. Then for T a 
Borel subset of M^S 2 ), 

1 1 

— inf H{y | /i) < lim inf — log P n [fi n a G T] < lim sup — log P n [fi n a G T] < — inf H{y | //); 

v&° n->oo n ' „_>oo n uer 

that is, the random measures /i njCr satisfy an LDP with rate function H(- | /i). 

Remark 2. As a corollary of the Sanov-type LDP above, or independently, one can prove 
a Cramer-type LDP for the magnetization M n := -X^Li ^ itself. Specifically, for the i.i.d. 
uniform random points {<7j}™ =1 on S 2 C R 3 , the magnetization laws [i n := £ (M n ) satisfy an 
LDP with rate function I : 

P n (M n ~x)~ e - nI( > x \ 
where I(x) = c\x\ - log (^jM. j = cco th(c) - 1 + log (-^y) ■ 

An LDP for the magnetization with respect to the Gibbs measures can also be proved, and 
we do this in the general case by deducing from Proposition [1] an LDP for the empirical spins 
/i n , jCr with respect to the Gibbs measures for (3 > 0, making use of equivalence of ensembles 
(see [9]). The hidden (Polish) space is Mi(S 2 ), the set of Borel probability measures on 
§ 2 equipped with the weak-* topology. The hidden process is {/i n ,o-}5£Li as in the theorem 
above, which satisfies an LDP with rate function H(- | /i). The representation in question 
here is of the energy per particle, rather than the Hamiltonian itself: 



h n (a) = ^ (a u <Tj) = -^(^j ' 2 xdfi ntCT (x), j ^ xdfi Uta (x)^ = ~ j ^ xd/i ntCT (x 

we define h : M^S 2 ) — > R by h{v) := — | |J g2 xdu(x)\ 2 . Note that the expression inside the 
norm is simply the center of mass of the measure v. 

We immediately obtain an LDP for the empirical measure of the spins at positive tem- 
perature by applying the equivalence of ensembles argument [9] to 

zM := [ e~^dP n and Pn>f) {B) : = — ^- / e~^dP n . 
Jn n Zn{P) Jb 

Proposition 3. With notation as above, the fi n ^ satisfy an LDP with respect to the Gibbs 
measures P n ^ := p n ^ on Afi(S 2 ) with rate function 



xdu(x] 
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where the free energy <p(/3) 
given by 



i logz n (n/3) 

HO" I *0 - £ 



lim^oo - log Z n (j3) exists and is 



inf 

^eMi(§ 2 ) 



xdv[x) 



(2) 



Moreover: 



(a) The set of canonical macrostates 8,p 
o/M^S 2 ). 



{v : /g(i / ) = 0} is a non-empty, compact subset 



(b) Non-macro states are exponentially unlikely. 

(c) For every (3 > 0, every subsequence of P n ,p{n>n,a £ •} has a subsubsequence converging 
weakly to lip, a probability measure on Mi(§ 2 ) concentrated on 8,p, i.e., Hp(8,g) = 0. In 
the case that Ep = {u} for one v £ M^S 2 ), then the whole sequence converges weakly 
to by. 

To identify the measures in Ep, first observe that if / = ^ then H [y \ \i) depends only 
on the value distribution of /; that is, (roughly speaking) once the values that / takes on 
the frequency with which they are taken on are fixed, the first term is determined. It is 
then easy to see that the expression | J xdv(x)\ is maximized (for a given value distribution) 
for corresponding densities which are symmetric about a fixed pole and decreasing as the 
distance from the pole increases. Consider, then, the case that / = a density that 
is symmetric about the north pole and decreasing away from the pole. That is, v g is the 
measure with density f(x,y,z) = g(z) which is increasing in z. Then 

0(cos(0)) log[0(cos(0))] sm(9)d6dip 



H(u g | n) 



By the same substitution, 



1 

Air 
1 
2 



g(x)\og[g(x)}dx. 



vdv g (v) 



1 xg(x) 



dx. 



The problem is thus to minimize 



1 P 
g(x) log[g(x)]dx - - 

i ^ 



xg{x 



dx 



for g : [—1,1] — > such that | g(x)dx = 1 and g is increasing. Observe that 



g{x)\og[g{x)}dx 



g{x) log 



9{x) 



dx + log(2) = -h (|) + log(2) 



where h(ip) is the (usual) entropy of the density <p. 

Now fix the value of \J xdu(x)\ £ [0,1] and minimize | J_ 1 g{x) log[g(x)]dx over the 
v £ Mi(S 2 ) corresponding to this value; this is a constrained entropy maximization problem, 
for which known results [1] imply: 
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Proposition 4. Consider the class of f : [—1, 1] — > M + such that 

• f-i f(x)dx = 1, and 

• J_. xf(x)dx = c. 

Then f*{x) = k\e k2X uniquely maximizes h(f) over the densities satisfying these conditions. 

Now, to determine k±, k 2 (their formulas are in the next proposition), observe that for /* 
to satisfy the first condition, 

1 k^dx = 2fclSi , nh(fc2) , 
i k 2 



and thus 



For the second condition, 



ko 



ki I xe k2X dx = k\ 



2 sinh(/c 2 ) 
2cosh(A;2) 2sinh(/c 2 ) 



ko 



k\ 



coth(/c 2 ) — 



Take g* = 2f*; considering all c G [0, 1] and requiring g* to be increasing corresponds to 
considering all k 2 G [0, oo). In that case, we need to minimize 
• i 



g*(x) log[g*(x)]dx - ^c 2 



log 



ko 



+ k 2 coth(/c 2 ) 



I 
2 



coth(/c 2 ) — 



(3) 



sinh(/c 2 ) 
over all k 2 G [0, 00). 

This optimization problem leads to a critical value of the inverse temperature (3 C = 3, 
and the phase transition is a continuous one (2nd order in physics parlance). Below the 
transition, the only macrocanonical state is the uniform distribution, and then increasing f3 
across the critical threshold, a spherically symmetric family of distributions with a preferred 
direction appears. At first the direction is hardly preferred at all, but with increasing /3 the 
preferred direction becomes more strongly preferred, so that in the zero temperature limit 
— > 00, the macrostates are point masses. 

More precisely, the following results are proved in the appendix and summarized here: 

Proposition 5. The free energy tp as characterized in Theorem^ has the formula (lemma 



<P(P) 



0, if$<Z, 



where is defined above in 



and 



g[x) 



X 



(3. 



coth x — 1/x 

Second- order phase transition: The function tp and its derivative tp 1 are continuous at 
the critical threshold (3 = 3, so the phase transition is continuous. 
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(a) In the subcritical case, (3 < 3, the expression fl3]) is minimized for ki = 0, and the 
corresponding k\ = 0, so that the minimizing function f* = l and hence the canonical 
macrostates in the subcritical case are uniform: 8.p = {yu}. 

(b) In the supercritical case, (3 > 3, the minimizing k 2 has limit lirrig^ k 2 = 0. 

The macro states £p = {^} Ig §2, where v x is the rotation of the measure v considered 
above, such that x plays the role of the north pole. 

One can furthermore use the computations above (or work directly) to derive an explicit 
formula for the rate function for the LDP of the magnetization - Y17=i Gi - Specifically, one 
has the following. 

Theorem 6. Let {cr}" =1 be a configuration of spins distributed according to the Gibbs measure 
described above, and let S n := Y^7=i- Then for a Borel set fel, 



— inf Ip(x) < lim inf — log P„ 

where 



P\S n 



n 



ccoth(c) 



e r 



log 



< lim sup — log P n>/ 3 



P\s n \ 



e r 



n 



< — inf Ip(x 



sinh(c) 



2 



coth(c) 



and c is the unique element o/M + such that \x\ = coth(c) 



Also of interest is the microcanonical ensemble, in which one fixes the energy per particle, 
and the following result gives an LDP in that case using the results of [9]. 

Proposition 7. Since the empirical measure of the spins yL n>a = -^25 ai satisfies an LDP 
with respect to the product measure P n with rate H(- | fi), we have the following: 

(a) The energies h{n n>a ) and H n satisfy LDPs with respect to P n with rate J, called the 
microcanonical entropy, defined for a fixed value u of the energy by: 

J{u) := mi{H(v \n)'.ue M^S 2 ), h(u) = u). 

The free energy if is a Legendre-Fenchel transform: tp(/3) = inf U £r{(3u + J(u)}. 

(b) If u G dom(J), then /i n , jCr satisfies an LDP with respect to the microcanonical Gibbs 
measure 



pu,r {A) 



J u J A 



r]}dP n , 



where Z u := L 2 ^{H„e[u+r,u-r]}dP n> and with microcanonical rate function 



H{v | fi) - J(u), if -\ (is, is) 



u: 



oo, 



otherwise. 



That is, 



inf I u {v) < limliminf-logP^' r (r) < limlimsup - logP n u ' r (r) < - inf I u {u); 



r— ¥0 n— >oo fi 



r->0 



n 
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(c) The microcanonical macrostates are 



:= {v : H{y | [i) = J(u), h{v) = u}. 

Again, it suffices to restrict our attention to symmetric densities symmetric about a 
pole with unit vector z, i.e., J vdv{v) = cz, because symmetrizing about the z-axis reduces 
relative entropy: Recall that djj, = djj, z ^^ where /i 2 is the uniform measure on the circle 
of radius \J\ — z 1 . Let f(z) := f f(x,y,z)dfi z (x,y), which is the symmetrized version of / 
about the 2-axis. Now, the function g(x) = xlog(x) is convex, so by Jensen's inequality, 



f(z)\og[f(z)] = g [ I f(x,y,z)d[M z (x,y) J < / g(f(x,y, z))dfi z (x,y) 

f(x, y, z) log[(/(x, y, z)}d{i z (x, y). 



Integrating both sides with respect to 4f shows that H{y \ jj) < H(u | /x), where u and v are 
respectively the measures with densities / and /. 
We can compute J to be 

If 1 V „ 1 



H(v\(i): ^- jxf{x)dxj =-2u, - j 
and then simplify it using the previous result on maximizing entropy, with k 2 solving coth k 2 - 



J(u) = log ( -r^— J + k 2 V^L. 
\smn k 2 J 



The microcanonical entropy is 

r(i/) = -/i(//2)+log2-J(u), if (^-Jxf{x)dx\ =-2u. 

The domain of J is (— 1,0], and the microcanonical macrostates Z u consist of rotations 
to any direction of v with density 

/ = — ^-e k2X , where coth/c 2 - — = y/ -2u. 
smh k 2 k 2 

In particular, £° = {fi}, the completely disordered phase, and for energies close to zero 
— \ <C u < 0, there is the expansion J(u) ~ — 3u — |u 2 + . . . , and k 2 — 3a/— 2m. Thus the 
microcanonical macrostates for small energy are £ M = {v x }xes 2 i where u x is the rotation of 
v to the x direction, again a continuous transition to the ordered phase. 
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3 The magnetization in the subcritical phase 



We restrict our attention to the subcritical (disordered) regime (3 < 3 in this section. 

First note that each of the spins Oi has a uniform marginal distribution, because the 
density of the Gibbs measure is rotationally invariant. In particular, E[<jj] = for each i, 
and thus the expected magnetization is zero. 

To compute the variance of the total magnetization S n := Yli=i a h note that ^ ( a i-> a i) = 1 
for each i\ moreover, by the symmetry of the Gibbs measure, E (cij, Oj) is the same for every 
pair i ^ j. Now, conditional on {o~j}j^i, the density of <i\ with respect to uniform measure 
on S 2 is given by 



^exp 



where the normalization is Z\ = J g2 exp ^ Ylj^i a j) dfi(9). For fixed i E {1, . . . , n}, let 

£jW ;= ^-/jCTj. Note that dfx(9) = ^j^dadcj), where (a,0) are spherical coordinates; Z\ is 
therefore given by 



rW ccosta) ■ , \, j, sinh(c) 
o Jo c 



where c 



Now, from the conditional density above, 



E [o-i|{^}jVi] 



9 exp 







n 



(7 



(1) 



47TZ! 

coth 



2tt pit 



a 



(1) 



Wl 



-<^^ (1) ) 

77, X ' 



exp 

/3 lo-^ 1 ) | cos(q) 



dn{0) 



cos a sin a e 



dadcj) 



o </o 



a 



(i) 



(i) 



n 



/i 



(3\a 



Ml 



a 



(i) 



(i) 



Recall that if /3 < 3, then 



- ~ f , and so we have that if /3 < 3, 



o(l) with probability exponentially close to one. Ex- 
panding about zero gives that for x small, coth(x) 
then 



3n 



(this argument will be made precise below). It follows by taking inner product of both sides 
with (72 followed by expectation that 

E(<t u <t 2 ) « L^h^a^aA = L [1 + („ _ 2 )E (a u a 2 )} , 
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and thus 

E {(7U a2) ~ 3n-/3(n-2) ~ ^3^S) " 

Finally, 

E|S n ,| 2 = nE|ai| 2 + n(n - 1)E (01, <x 2 ) 



3-/3 



We therefore consider the scaled magnetization M = y X^Li ^ ne remainder of 
the section is devoted to proving the following central limit theorem for M. 



Theorem 8. For (3 < 3, there is a constant cp depending only on (3 such that for M 

ITU- (\*\ W IVW S C P l °^ n ) 

sup \hg(M) — Kg{Z)\ < 



g:Mi(g),M2(s)<l 



where M\(g) is the Lipschitz constant of g, M 2 (g) is the maximum operator norm of the 
Hessian of g, and Z is a standard Gaussian random vector in M 3 . 

The theorem is proved as an application of the following abstract normal approximation 
theorem from Ua 



Theorem 9. Let (X, X') be an exchangeable pair of random vectors in M. d . Let 5F be a 
a-algebra with c(X) C J, and suppose that there is an invertible matrix A, a symmetric, 
positive definite matrix E, an 5 '-measureable random vector E and an J ' -measureable random 
matrix E' such that 

(a) 

E[X' -X\3] = -AX + E 

(b) 

E [{X' - X)(X' - X) T |3 r ] = 2AS + E' . 



Then for g G C 



\Eg(X) -Eg^ZM < M 1 (g)\\A- 1 



nop 

2T 



EI^I + illE-^llopEH^IlH., 



(4) 



21 



M 2 (^)||E^ 1 / 2 || op ||A- 1 || op E|X / -X| 3 , 



where Mi(g) is the Lipschitz constant of g and M 2 (g) is the maximum operator norm of 
Hess (g). 

The first step in applying the theorem is to construct an exchangeable pair (M, M'). 
As is frequently the case, the exchangeable pair will first be constructed on the level of 
configurations, and then descend to the magnetization. Given a fixed configuration er, 
construct a new configuration a' by letting / be distributed uniformly in {l,...,n} and 
replacing 07 by a'j, distributed according to the conditional distribution of the J-th spin, 
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given {a-,- : j ^ I}, and denning erj := aj for j ^ I. Then M = i/^^" =1 (Ti and 



M' = M(a') — M — J ^cr/ + y ^o"/! that is, M is the magnetization of the configuration 
in which a random spin has been replaced via the Gibbs sampler. For notational convenience, 
let a := -y/3 — (3. Then, using the computation above, 

n 

E [M' -M\a]= -JL; h " E Nfo}i*]] 



■i=i 



1 , , a \ 
-M+-—y 



coth 



Now, since /3 < 3, it is known that 



i=l 



/3|a 



/3|a«|J Vk W 



a 



(0 



o(l) with probability exponentially close to 



1. We therefore use the expansion of coth(:r) — - near zero to write 



E\M' -M\a] = M 

Tl 



EE/ 3 



3n 5/2 L^L^r°i 

1=1 j^i 



ri 



8=1 



coth 



n 



(Ol 



n (3\a 



(Ol 



/3|o-W| 3n 



(T 



(0 



r(0 



Note that 



i=l j^ti i=l N v / 



n V 3 3n 



M. 



i-£ 

The matrix A of Theorem [9] is thus — -Id and 



a ^-v 

^3/2 / ^ 
i=l 



3n 



coth 



/3|a 



(Ol 



n /3|cx 



(0| 



/3|a(0| 3n J V> W 



(0 



We next compute E [{M' - M)(M' - M) T \a] . By the same considerations as above, this 
is given by 



2 n i p 

E [(M' - M)(M' - M) T \a] = - 2 V ± / (0 - a,)(9 - *) T <«p 

n i=1 z « </s 2 



/3 



E 



d//(0) 



n 

El!r 



i=l 



cr^ T — 6*0"^ + cXjCrf ] exp 



n 



E 



(7 



.7- 



dn{9). 



Letting 9 = 9\ + 9 2 , where 9i is the projection of 9 onto the direction of a^> and #2 is the 
orthogonal complement, the first term of the last expression is given by 



1 



[etfl + 9 2 9 T 2 \ exp 



P 



dm, 



(5) 
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since the cross terms vanish by symmetry. To compute it, write n = j^pyy, so that 9\ 
(9,n)n, and 9 X 9\ = \ (9, n) \ 2 nrf ; setting c := 



9 1 9{exp 



H JS 2 



P 



n 



|(6>,ri)| 2 exp [c (n,9)]dfi(9) nr. 



(Recall that nrf is orthogonal projection onto the span of in IR 3 .) In spherical coordinates 
(with Ti playing the role of the north pole), the integral is then given by 



2-7T m 



|coso;| 2 exp [c cos a] sin adadcj). 



J t JO JO 

Evaluating and using the established formula for Zj yields 



1 



9 t 9fexp 



H J §2 



n 



y^9) 



d^{9) 



c 2 - 2ccoth(c) + 2 



Pi. 



where Pi is orthogonal projection onto r^. 

Now, for the second half of flH]), let (9 x ,9 y ) be a representation of #2 in orthonormal 
coordinates within r±. Note that 

9 x 9 y e c{n ' e) dfi{9) = 

by symmetry. Expanding in polar coordinates, 

9 2 y e c{n ' e) dn(9) = — / / sin(a) 3 cos(0) 2 e ccos{Q) dadcf) 

= ~jf sin(a) 3 e ccos(Q) c/a 
ccosh(c) — sinh(c) 



Of course, the value is the same when 9y is replaced by 9%, and thus 



- / 9 2 9i exp 



n 



= ccosh(c)-sinh(c)^ 



Zc 3 



coth(c) 



where P^ is the orthogonal projection onto rj~ . 

Formulae for the middle terms follow from the computations above: 



— / Oaf exp 



P -{^\9) 
n x ' 



d^{9) 



coth(c) 

c 



The next term is just the transpose of this one. 
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Finally, the last term is trivial: 

1 



— / a i a i exp 



n x 1 



dfi(8) = <jidl \ 



Collecting terms, 

2 n 

E [{M' - M)(M' - M) T \a\ = ^ E 



i=l 



2c coth (c) - 2 



P + 



coth(c) 1 



P 



coth(c) 



where c = ^ ^ - . Recall that if c = o(l) then coth(c) — \ ~ | . In this case 
E [(M' - M)(M' - M)>] = fL £ Up, + tp} 

i=l \- 



E" 



a 2 ^ a 2 
3n n 2 



(6) 



3n 2 



%=\ i=l 
where the error term E" incurred in this approximation is 

i2 — < r 2 2c coth (c) - 2 



P" 



n- 



t=i 



P 



coth(c) 1 1 
c c 2 3 



P^ 



coth(c) 



1 c 
c ~ 3 



Observe further that, for the second-last term of using = t^-t and c = ' yields 



a 2 c 
3n 2 



i=i 



i=l j^i 



i=l 



Putting the pieces together, 



E[(M' -M)(M' -Mf\a] 



1 n 

Id H — > 3(7^0", 
n 

(^)Ri> 



2£ 
3n 2 



2/3_ 
3n 4 



i=i 



<7; 



3 MM- + ^±a,,f + E». 



3n 2 



i=l 



Note that if X is uniformly distributed on the sphere, then if U G O3, 

E[II T ] = E[UXU T UX T U T ] = UE[XX T ]U T , 
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and thus E[XX T ] is a scalar matrix. Moreover, Tr (XX T ) = 1 since XX T is a rank- 
one projection, and thus E[Tr (XX T )] = 1 and so since E[JX T ] is scalar, it follows that 
E[XX T ] = \ld. That is, E[<T;<7f] = ^Id for each i. We therefore write 



E [(M' - M)(M' - M) T \a] = 2 



1 —L ) £ + 



n 



as in Theorem [91 with S = Id and 



E' 



ii 



- V Zuiu] 
n ^— ' 



Id 



i=l 



3n 2 



i=l 



Note in particular that the expected value of the first term of E' is zero. 

To apply Theorem [9J various quantities need to be computed or estimated. Recall that 



A 



-Id, and thus II A 1 | 



op 



Now, the first term of E is — ^M. Note that, while it 



was previously argued heuristically that E|M| 2 m 3, one can in fact use the same argument 
together with the fact that coth(x) — ~ < | (shown in the proof of Lemma [T4I) to show that 
E| M| 2 < 3. It then follows that 



^E|M| < A v ^wp < 



3n 



P 



\/3; 



To estimate the second half of E, fix e = e(n) £ (0, 1) to be chosen later. For notational 
convenience, let R(t) := coth(t) — j — |; observe that if t < e then |-R(t)| < fee 2 , where 6 is a 
universal constant. Then the second half of E can be estimated as 



a 

n 3/2 



E 

i=i 



wi 



a 



(0 



(7 



(0 



< 



a 

372 



E 

i=l 



(01 



a 



+ 



3/2 



E 



i=l 



R 



P\a® 



P\a 



(01 



(0 



(i) 



n 



> e 



making use of the fact that \R 



< 



6ae 2 



n 

a \ 

372 



/3|a 



(01 



> e 



n 



P 



> e 



< 1 for any configuration a. From the LDP for <jW, 
< C exp 



--infjTg^) : x > e} 



where 



ccoth(c) 



log 



sinh(c) 



P 
2 



coth(c) 



and c is the unique element of M + such that |x| = coth(c) — -. It is shown in the appendix 
that Ip{x) is increasing for < 3, and thus ini{Ip(x) : x > e} — Ip{e). Moreover, there is a 
universal constant r > such that for e £ (0, 1), Ip{e) > ^ (l — |) — re 3 . It follows that 



P 



PW 



01 



> e 



n 



< C exp 



T2" 



+ nre 
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Moving on to the error E', it is true in general for x6l" that ||££ T || J ff,s = \x\ 2 , and so 

E\\MM T \\ HS = E\M\ 2 < 3 



and 



E\\a i aT\\ HS = E\a i \ :e = l. 



Estimating E||±£" =1 (3a iaf — Id)\\ HS is a bit more involved. First, recall that 
^/Tr (AA T ), and so by the Cauchy-Schwarz inequality, 



E 



1 n 

n z — ' 

i=i 



HS 



1 

< - 

n 



\ 



ETr [{3a i( jf - Jd)(3a j( rJ - Id)] . 

»J=1 



Now, 



ETr [(Satof - Id) 2 ] = E [9Tr foal apf) - 6Tr (a i0 f) + Id] = 9E|a z | 4 - 6E|a; 
Similarly, for i ^ j, 

ETr [(3«7<fff - Id){3(Tjof - Id)] = 9E [(^, a^) 2 ] - 3. 

Observe that 

E [{<T 1 ,<T 2 ) 2 \{<?i}&l] = [^l^K^i}^!] °2 



3 = 6. 



— / ^ T exp 



n 



°2 



2ccoth(c) - 2 



dn{9) \ o 2 
ccoth(c) — 1 



where we have made use of the computation of expression (jHj) carried out earlier, and again 

c = ^ - , Pj denotes orthogonal projection onto the span of a^ l \ and P- 1 denotes orthogonal 
projection onto the orthogonal complement of the span of erW. Once again making use of 
the fact that c = o(l) with high probability, and so coth(c) — - 



3 45' 



E [{a u a 2 ) 2 |{<7 t } ¥1 ] « 4 ^Pi 



45 



1 c 2 c 2 



3 45 15 



+ — E [a 2 r a 1 ala 2 \{cri}i^i\ ■ 



Taking expectation of both sides yields 



1 



E [K02} 2 ] « - + E 



c 2 c 2 



45 + 15 (cTl ' Cr2) ' 



so that 



ETr [(30-^ - Id)(3a j aj - Id)] « E 



c 2 3c 2 



(0-1,0-2)' 



The error incurred in this approximation (for each pair % 7^ j) is 



E 



90 



T 



2 2ccoth(c)-2 2c 2 

3 c 2 45 



P + 



ccoth(c) — 1 1 c 2 
c 2 3 + 45 



< r x Ec 3 
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where t\ is a universal constant. 
Now, 



Ec 2 



IP 



ir 



/3 



rr 



2 r 



n - 1 + (n - l)(n - 2) 







ra(3 - /3) 



< 



3/3 s 



ra(3-/3)' 



and one can then trivially also estimate that 

Ec 3 < 
and so 



3 , 3/3 3 



n(3-/3)' 



E 



1 n 

i=l 



'6 + 



< 



(3-/3) 



n 



The remaining term of the error E' is E", for which the remainder from Taylor's theorem 
also suffices: 



e||£1^<^Ve 

i=l 



2 2ccoth(c)-2 
3 



+ 



HS 



coth(c) 1 1 
c c 2 3 



PS 



HS 



i / \ 1 c 

coth(c) 

c 3 



HS 



< 



< 



rt-5 Z— / 



i=l 



c lv /3(3-/3)/3 



77/ 



3/2 



for a universal constant c\, using the facts that H-PiH^rs, ||-P^ ||hs an d 1 1 1 "/ Whs are an 
bounded by y/2 or better and that E|<7 (i) | < 



The final error term from Theorem [9] is trivial: 

a 3 

E\M' - M\ = -^jE|c4 - ct/I < 



8a 3 



■/?.■ 



3/2 ' 



Using the estimates above in the statement of the theorem yields 

3ci 



\Eg(M)-Eg(Z)\<M 1 (g) 



6 + 



r-2(/9 2 +/3 3 ) 



(3-/3) 

An 



3b 



+ Mi(p)-==v^ 



e 2 + C exp 



, 7/3 


) (3 


-0)n 


Ue2 (l 
12 V 


-1) 



+ 



2/3 



+ M 2 (g) 



2?r(3 -/3) 
n 



for g e C 2 (IR 3 ). Finally choose e = e(n) such that e 



2 _ 121og(n) 



1(3 



We may now conclude that if (3 < 3, then 



snp |E,(M)-E,(Z)|< C ^° g(n) 



g:Mi(g),M 2 (s)<l V n 

for some constant depending only on j3. 



4 The magnetization in the supercritical phase 



In the ordered regime, where /3 is large, the spins tend to align. Indeed, it follows from the 

c 

2 



large deviations principle that \S n \ is close to with high probability. Let 



W :-- 



n 2 k\ 



(7) 



we show below that W satisfies a central limit theorem. Since the distribution of the magne- 
tization - Y^j=i a j * s rotationally invariant, this gives a complete picture of the asymptotic 
behavior of the magnetization. 

In order to obtain the central limit theorem, we apply the following version of Stein's 
abstract normal approximation theorem (see P2], pg. 35), essentially due to Rinott and 
Rotar ([15], Thm. 1.2). 

Theorem 10. Let h : R — >• K. be bounded with bounded derivative. Suppose that (W, W) is 
an exchangeable pair and let J be a a-field with respect to which W is measureable. Suppose 
further that there is A > and an J '-measurable random variable E such that 

E[W - W\3\ = -XW + E. 

Then if Z is a centered Gaussian random variable with variance a 2 , 



\Eh(W)-Eh(Z)\ < 



7T \\h\\^E\E\ 



A 



f2||/i|| 00 E 



2A 



E [{W - W) 2 ^] 



\h'\\«M\W' - W\ 



4A 



To apply the theorem to W as defined above, construct an exchangeable pair (W, W) 
using the Gibbs sampler as before; that is, define W by first replacing a randomly chosen 
spin in {aj} according to its conditional distribution given the rest of the spins. Then 



E [W- W|<j] 

/3 2 




coth 



(01 



n 



n 



P\a®\ 



(0| 



2 -w 

n 



coth 



i=l 



(0| 



n 



n 



/?|o-(0|_ 



(0| 
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For notational convenience, let g(x) := coth(x) — -. Then observe that 



ii 



P\S n \ 



11 



< y\ 



P\° {€) \ P\S n 



ii 



n 



< 



\g'\\oc/3 



n 



since S n which has length 1. It follows that 



2/3 2 



n 5 / 2 ko 



P\S n 



n 



(7 



< 



2||^||oo/3 2 



n 



3 / 2 k\ ' 



since \o^'\ < n. Next, observe that 



2/3 2 



5> 



i=i 



P\S„ 



n 



\o 



(*) I _ I Q 

On 



< 



3 / 2 fc 2 ' 



77/ 



and so 



E[W'-W\a] = --W--^=; 

1 J n \/n n 6 i /L k, 



2f3 2 fj3\S< 



2 9 
2 



n 



\S n \ + Ei, 



where E\Ei\ < 4^. 

We next approximate g (j^^- 
LDP for \S„\. We have that 



by a first-order Taylor polynomial, making use of the 



lim sup — P 7 



n 



n (3 



> e 



< - inf Ip(x), 



|M-tH>« 



(9) 



where 



Ip{x) = <S>fi(y)-ip(P), ®p(y) = y|x|+log 
and y is uniquely defined by 



sinh(y) 



0, , 



coth(|/) 



1 



Recall also that 



^2 



¥>(/?) = iuf$ j8 (y) = -± + log 

ir>0 zp 



sinh(/c 2 



It was moreover shown in the appendix that |x| = %■ corresponding to y = k<i is the unique 



minimizing set for and that <&p(y) is decreasing as a function of \x\ on 



^ 



and 



increasing on 



k2 



00 ) . This means in particular that 



inf /^(x) = min ^ / )? ( y — + e , Jg [y — - e 







/3 



where by Ip(y(t)) we mean the value that /^(y) takes on for all x with |x| = t and y defined 
in terms of I above. 
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Now, we know that is minimized on |x| = so that (/C2) = 0. It is shown in 

the appendix that $^ (k 2 ) > 0, so that there is a constant Kp such that 

inf Ip(x) > Kpe\ 



and so 





\Sn\ 


k 2 


> e 


< 




n 


(3 







-K R ne z 



Applying this estimate then yields 
2(3 2 \Jf3\S, 



E 



n 2, / 2 kl 



9 



n 



■ ^||/||oo6 2 E 



n^k 2 



9 (h) - g'(k 2 ) 



P\s„ 



-k, 



n 



n 



+ 



2/3 2 



n 3 / 2 k 2 



[aiMloo + H^llooGs + ^E 



1*5' 
< e 
\S n \¥ 



n 



> € 



< 



2(3 2 
\fnk\ 



Ce 2 + C'e- K e ne2 



where we have also used the trivial estimate \S n \ < n in the last line. Choosing e 2 = lo ^^ 
gives that 



E 



2/3 2 



n 3 / 2 k 2 



P\S n 



n 



-g(k 2 )-g'(k 2 ) 



P\s n 



n 



< 



log(n) 



n 



3/2 



for some constant Cp depending only on 0. 

Combining this estimate with (jSj) and recalling that g(k 2 ) = y gives that 



E \W - W\a\ = --W + -^= 
1 n \/n 



P\S n \ 

nko 



2f3 2 g\k 2 ) fm 



rfil 2 k\ 



n 



k 2 )\S n \+E 2 , (10) 



where E|.E 2 | < 



cp log(n) 
n3/2 ' 



Now, observe that 



\Sn\ 



nk 2 

T 



1 + 



W nko 



n (3 



where \E Z \ < This means that the second term of (jTUl) is 

W 2E 3 



n wn 



and E 



2E Z 
s/n 



< -§^E\W\ 2 < ^f^, using the LDP as above. Similarly, 



2/3V(fc 2 ) 
n 3 l 2 k 2 



E 



P\S n 



n 



- k 2 \S n 



k 2 W 
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and 



E 



W, C | VngW 



2/3 



. W(fc2) E 



4/3 



< 



C/3 log(n) 



ii 



3/2 



again using that E|IU| 2 < log(n). Combining these estimates with ( ITOjl yields 



(1 ~ 

n 



W + P 4 , 



(11) 



where again 

E|P 4 | < ^jr 1 - Recall that it was shown in the appendix that 0g'{k,2) 



< 1, and so the approximate linear regression condition in Stein's abstract 



k'2 sinh 2 (fc2) / 

normal approximation theorem holds with A = — ^sJJ^ml _ 

The next step is to compute E [(W — W) 2 \a~\ . From the definition, 



E[(^-^)>]=-^$> 



1 2 J2( a *i - ai > a i) 1 

2 2=1 j,fc^2 



4/3 4 



(12) 



Now, E [(<r* — — o"j) T |cr, 7 = i] was already computed exactly in the < 3 case, and 

was found to be 



E [(a* - (Ti)(a* - CTi) T \cr, I = i] 



2cj coth (q) — 2 



P 



coth(Q 



coth(Q 



1 

Cj 



Ci 



where c, = ^ - , rj = jfpyr, and p is orthogonal projection onto 



Recall that 



k 2 {n-l) 



n 



n 



and that 



Using this above, 



coth(fc 2 ) 



> e 



fa 



< e 



-Kf,ne z 



k 2 



E [(*? - tr^oj - 7 = *] = - |) P + Ip. 







( 1 - ^ ) P - h(n(jj + ar?) + o-iaf + P,', 



1 

J 



(13) 



where 



E' 



g(ci) _ 1 

Ci " 



Id + 



3#(q) _ 3 
<k ~ 
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P - S(c 



/3 



Ignoring the E[ for the moment and putting the main term of (fT3l) into (fT2l) yields 

4/3 4 



EE 

* j,k^i 



- (CTj, CTfc) + II - - I PiO-fc - — (CT^. 7^ (7 fc + (7j (7^ (T k ) + 0^ ^0"- CTfc 



The first term is 



(t)|2 



i j,kj^i 



i=l 



For the second term, note that aJPiUk = Tr (akcrjrirf ), and so 

£ ofP,** = Tr (^VTrirf) = {^\nf = |a«| 2 



and thus 



n 4 fc 4 



/3 



2 i 

Similarly, for the first half of the third term, 

4/3 3 



E E »f ^ 



n 4 H - 

and the second half is the same. Finally, 

4/3 4 



4/3 4 
4/3 3 



\cr 



(i)|2 



n 4 A;| 



i j,k^i 



all together, 









+;§ee^, 

i j,k^i 

In order to apply Stein's theorem, one must recognize in this expression a deterministic 
part plus a mean zero part. With this motivation in mind, we write 



E[(W -Wf\a] 



4/3 4 



2\ (n - l) 2 k 2 2 _ 2n 2 fc 4 

" J ^ w 



n 



LL.4 



(t)|2 _ ( n jOj^f 

/3 2 



n 4 fc 



2A; 9 







I ( i ff (0 







l _ 2 ^ (n - 1) 2 A| 



(14) 
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and define a such that, to top order in n, 

4/3 4 \ n f 1 _2\ {n- l) 2 k 2 _ 2n 2 k\ 



v?k\ 



& P 2 



4 



2\a 2 



for A 



i-/V(fc 2 ) 



as above. Note that the top order in n of the expression inside the paren- 



theses can be simplified to 



2n 2 k 2 2 



0" 



1 ^ ^2 



2n 2 k 2 

P 2 

2n 2 k\ 

P 2 
2n 2 k 2 

P 



2 (coth(fc 2 ) - i 



1 - 



cosh 2 ^) 1 



coth(fc 2 ) - — 

^2 



sinh 2 ^) fcf 
1 1 
fc| sinh 2 (fc 2 )_ 



>0, 



so defining a in this way does in fact yield a strictly positive value of cr 2 which depends only 
on (3 and is independent of n. 

Then to apply Stein's abstract normal approximation theorem, it is necessary to estimate 
the expected absolute value of each of the terms above (except the 2 Act 2 part). 

For the first term, it follows as before from the LDP for \a^'\ that 



E 



a 



(j)|2 _ ( n 1) 2 ^2 



P> 



<(n- if 



e + e 



-Kp{n-l)e 2 



taking e 



log(n) 
Kp{n-1) 



shows that 



E 



(t)|2 _ i 71 ~ ^) 2 ^2 
P 2 



[n)n 



3/2 



Next, observe that 



\{<Ti,<T®) \<T®\ - (<Ti,S n ) \S n \\ < \(*i,(T®)\ + \S n \ < 2n. 



Moreover, 

4/3 4 



n 4 kl 



E 



S n ) 



n 2 k\ 



y3 3 



n 4 k 3 



E 



no h,o 
Ai 2 



P" 



< 



eg A/log(n) 



3/2 



Now, using the same definitions for g, q, r, and Pj as before, 



E 



E 



E 



^2 a J E [^^K^WJ °"fe 



3g(ci 



Ci 



p l+ 9{ci) Id 




= E 


^i 2£,(q) ^ i^^i 2 ! 


Ci 
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this explains the choice of constants in ( Fl4|) . Furthermore, 

2 



E 



2\ (n- l) 2 *; 2 



nE ( (a"i, cr 



(l) 



2^ (n- l) 2 *; 2 



+ n(n - 1)E <ai,a (1) ) - 1 



0; /? s 



2^ (n- l) 2 fc 2 



(15) 

The first term will simply be estimated by cgn 5 . For the second, first let a'- 1 ' 2 '' := Y^j>2 a j 
and observe that 



E {a u a^f (a 2 , a^f - E (<7i, a^f (a 2 , a^f 



< Cn 3 . 



Now, 



E<a 1 ,a( 1 )> 2 <a 2 ,a( 1 ' 2 )> 2 = E 



\^ T T T T 
1^ l^ a i 0\O x OjO k Oi0 2 Ot 



i,j>l k,£>2 



E 



E 



^ ^ crfE [crio-f |{a m }, n> i] ajala 2 a2(Ti 

i,j>l k,l>2 



By the LDP for er^, we can replace by 4 and |<r < - 1 ' ) | by ^ n 3^ fcz , incurring an error of size 



(n-l)fc 2 



cp ^\og{n)n 7 / 2 . At this point we are left with 



E 



1 - -x 



l?k l{a 2 ^) 2 



0) P 



We can now go back to instead of cr^ 1 ' 2 ^ (with another loss of order n 3 ), and therefore 
replace the last expression with 



f/ 2\ (n-im / 2g(c 2 )\ (2) 
IV P/ P 2 V C 2 / 



(using the expression for E[(Vj, cr®}] obtained above). One final application of the LDP for 
now means that, with loss of the same order as before, this expression is equal to 



1 - 



2\ 2 (n- Ifki 



P P 4 



Using these approximations in ( flBT) now yields 



E 



2\ (n-lfk 2 



P* 



-1 2 
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and so 



4/3 4 

rM4 



E 



e(<--»> 2 -0-I) 



(n - Ifkl 



2 



< C(j (]Q S (n)y<*n 



l/4_-5/4 



Exactly the same sorts of arguments using LDP for <jW also imply that all the errors 
from the E[ terms are smaller than those already accounted for. 
Finally, 



EIW- W\ 



1(3' 



n 



9 / 2 kl 



-E 



< 



1(3" 



The following result is now a consequence of the bounds above together with Theorem 

eh 

Theorem 11. Let W be the recentered, renormalized norm squared of the magnetization, as 
defined in ([?]) . There is a constant c@ depending only on (5 > 3 such that if Z is a centered 
Gaussian random variable with variance 



a' 



4/3 s 



for g(x) = coth(x) — then 



sup 

||fc||oo<l 
\\h'\\oc<l 



'l-Pg'{k 2 ))kl 



Eh(W)-Eh(Z) 



_k\ sinh (k 2 ) 



< 



c /3 



log(n) 



n 



1/4 



5 The critical temperature 

In this section, we prove a nonnormal limit theorem for the squared-length of the total 
magnetization at the critical temperature (3 = 3. Since the magnetization is spherically 
symmetric, this provides the limiting picture in the critical case. 
Recall that the LDP for \S n \ implies in particular that 



P 



3 iSV> 



> € 



< C exp 



n 



inf{/ 3 (x) : x > e} 



where 



J 3 (x) = ccoth(c) — 1 — log 



sinh(c) 



coth(c) — 



and c is the unique element of M + such that |x| = coth(c) — -. It is shown in the appendix 
that ^{x) is increasing, and thus inf{/ 3 (x) : x > e} = /3(e). Moreover, there is a universal 



constant r > such that for e G (0, 1) 
the subcritical case). It follows that 



— r £ (note in particular the difference from 



P 



3 ISr, 



> € 



< C exp [— cne 4 ] 
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for some constant c > 0, and so 



E 



3 \£>n\ 



n 



< e + 3C exp [— cne 4 ] . 



Choosing e 4 



lo ^ w ^ shows that E[|5 n |] < C ^ . This at least suggests what turns out 
to be the correct normalization for the magnetization; below, we prove a nonnormal limit 
theorem for W := Y^ij=i a j) > f° r a particular choice of c 3 (independent of n). Indeed, 
one can use a modification of the argument given in the beginning of section [3] to show that 



E\S n \ 



n + E 



\S n \ 



2 5^ 



2 • 3^ I Sn 



+ lower order terms 



5n 2 n 945ra 4 
from which it eventually follows that, to top order in n, there is a C3 such that 

- n 3 / 2 



E\S„ 



c 3 



E\S„ 



5rT 



The proof of the limit theorem for W as defined above is essentially via the so-called 
"density approach" to Stein's method introduced by Stein, Diaconis, Holmes and Reinert 
[TT] ; see also the recent work of Chatterjee and Shao [3], with an application to the mag- 
netization of the mean-field Ising model. The following theorem provides the framework we 
use for the approximation; the proof is given in Section 16.21 of the appendix. 

Theorem 12. Let (W,W) be an exchangeable pair of positive random variables. Suppose 
there exists a a-field U D o~(W) and k > deterministic such that 

E[W - W\&\ = 3fc(l - cW 2 ) + E 



and 

Let X have density 



E[{W - W) 2 \3] = kW + E'. 
pit) 



1 R — SL- 

-x e 2 



t > 0; 
t < 0. 

Then there are constants C±, C2, C3 depending only on c such that for all h e C 2 ( 



\Eh(W) -Eh{X)\ < 



k 



'-E\E\ 



Co(\\h\ 



\h'\ 



+ 



CM\h\ 



\h'\ 



E\E'\ 
E\W'-W\ 



Within this framework, we proceed as before: let W :— ^§72 ^2™j=i °j) ' anc ^ ma ke an 
exchangeable pair (W, W) by replacing a random spin using the Gibbs sampler. Then, as 
in the supercritical case, 



E [W - W\a] 



c 3 



n 



5/2 



E 

i=l 



2^(a u a k )-E 



2 5Z ( a i> a k) \i a j}j¥=i 



- 2 -W 

n 



2c, 



2c, 



n 



3/2 



n 



5/2 



E 

i=i 



/3|a«| 




V n 





(16) 
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again using the notation g(x) = coth(x) — |. Now, near zero, g(x) = | — ^ + 0(x 5 ). Using 
this above, 



E 



.9 



3|a«| 



(Ol 



E 

i=l 



\o 



»l 2 IcrWl 4 



n 



5n 3 



o 



(i)|6 



Note that 



- E ^ = E °i> a * - - E a *' ^ + 1 = — + L 

8=1 J,fc 



Similarly, 



hn 3 



1 " 



(7 



(i):4 



i=l 



nW 2 + AW 2 4T,ii<7i,Sn)'' 



5c 2 



5c 2 



5n 3 



2W W 1 

+ z ,/„ + 



5c3\/?2 5n 3 / 2 C3 5n 2 



Using these expressions in ( fTBT) yields 



3 / 2 5c3n 3 / 2 



where 



8PU 2 CW 3 R(a) 

+ 3 + 



n 2 bc^rv'l 2 n 2 4 n5 ^ 2 ' 

C is a universal constant, and furthermore, R(cr) < C almost surely. Note in particular the 
cancellation of the -W terms, which is the crucial difference from the subcritical case, and 
the reason that the limiting distribution of the magnetization is not Gaussian for (3 = 3. 

Recall that the LDP for ^ a>n gives us that EW, WW 2 , EW 3 < log(n), and so it follows 
that E\E\ < 



To apply Theorem [121 we take k = 2 ^ 2 and 



r>cr, 



The next step is to compute E [(W — jy) 2 |<r] . From the definition as before, 



E [{W -W) 2 \a] =^EE E l°fW ~ - ff *) Tff *k» I = i \- (17) 

i=l j,kj^i 



Now, E [(a* — CTj)((7* — o"j) T |cr, / = z] was already computed exactly in the (3 < 3 case, and 
was found to be 



E[(a* - ai)(a* - <Ti) T \a,I = i] 



2ci coth (q) — 2 



P + 



coth(cj) 1 



coth(cj) 

Ci 



Cj 



(nof + airj) + cna? , 



where c,- 



Pi is orthogonal projection onto r«, and Pj -1 is orthogonal 



projection onto the orthogonal complement of r^. 
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Now, recall that g(x) = coth(x) — ~ ~ | for small x. Using this above, 

E [<tf ~ - I = i\=\ld- C j(naJ + o.rf) + a % aj + E' t , (18) 

where 

E[ = - i) Id - (g( Ci ) - |) (no? + ffi rf) . 

Ignoring the E[ for the moment and putting the main term of ({TBI) into (TTTI) yields 

1 c- 

- (cry, o-jfc) - -^(ojriofok + ojoirjok) + ojoiofok . 
The first term is 

2 2 n 

^EEfe^Ei^i 2 

i j,k^i i=l 

The first half of the second term is 

-57 E E = "57 E <"'". = -| E i- ,0 i 2 (-"'•-■) • 

i j,kj^i i i 

and the second half is the same. Finally, 

|££^W^ = §£<^ ,,) > 2 ; 

all together, 



r 2 

n 4 4^ ^ 



e[(w'-wo>] = £x; 



iU(i)|2_l| a W|2 



,07, cr 



(i) 



n 



E 



W|2 



(i).2 



E E 



2 j,k^i 



2c| 
3n 3 



l^n| 2 - 



|5 n 



/J 



2c| 
n 5 



where the computation for E 



(auo^y 



from the supercritical case has been used. Recall 

that the main term should be §^77 and indeed it is. It is a routine collection of arguments 
very similar to those in the previous sections to show that the remaining terms are bounded 
in expectation by 



Clog(n) 

~2 
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Finally, 



E\W - W\ 3 = -^rE 



^2 Wi ~ *i7 °j) 



< 



8d 



[8E\S n \ 3 + S] < 



C log(n) 



n9/2 L I "I J — ^9/4 • 

Using these estimates and the choices of k and c in Theorem [12] yields the following 

Theorem 13. At the critical temperature (3 = 3, let W := 03 ^ , where C3 zs snc/i £/ia£ 
EFy = 1. Lei X have density 



p(t) 



l t 5 e -3 C t 2 f > o 



n 



£ < 0, 

where c = and z is a normalizing factor. Then there is a universal constant C such that 

sup \Eh(W) - Eh{X) I < ° 

||/l||oo<l, ||/l'||oc<l 
||fc"||oo<l 

6 Appendix 

6.1 Calculus of $p 

Lemma 14. If (3 < 3 ; t/ien 



inf <^ log . 

x>o I \ smh(xj 



achieved only at x = 0. 



+ x ( coth(x) 



x 



P ( , / \ 1 
-fcoth(x)-- 



0. 



Proof. We show first that the expression to be minimized is increasing. Differentiating the 
expression in question yields 



sinh(x) 



.r 



1 



sinh(x) — x cosh(x) 



sinh (x) 



coth(x) 



x 



+ 


coth(x) 

X 


+ X 


' 1 


1 




_x 2 sinh 2 (x) _ 





x 2 sinh (x) 



x 2 sinh (x) 



x — (3 [ coth(x) 

x 



Expanding sinh(x) in a Taylor series, 

sinh 2 fx) = 



x 



£1 x 2n+l 



^ (2n + I) 

?1=1 v ' 
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> x 2 . 



The problem is therefore reduced to showing that 



x — /3 ( coth(x) ) > 0, 



or alternatively, 



x 



xcoth(x) — 1 

Clearly showing this for /3 = 3 suffices to prove the lemma. Rearranging yet again, this is 
equivalent to showing that 

... 1 x 

coth(x) < — . 

x 3 

Expanding coth(x) in terms of e 2x and rearranging terms, this is furthermore equivalent to 
showing that 



1 — x 



x 



e 2x > 1 + x 



x- 



Expanding e in a Taylor series, the left-hand side of the inequality above is given by 



1 + x + — + 22 x n 



n=3 



2" 



in— 1 



m-2 



+ 



(n-1)! 3(n-2)! 

„2 



1 2 n ~ 2 x n 



n=3 



3n\ 



7 

n 

2 



It is easy to see that the n = 3 and n = 4 terms in the power series above are zero and that 
the rest are all positive, thus completing the proof that the expression to be minimized is 



increasing. Moreover, recall that lim^o coth(x) — - = and lim 



lim < log 
no 1 



sinh(x) 



x ( coth(x) — 



x 



i 

2 



sinhO) 
2' 



coth(x) 



X 



1, so 



0. 



Lemma 15. For f3 > 3, there is a unique value of x G (0, oo) which minimizes 

2 



log 



X 



sinh(x) 



x ( coth(x) — 



x 



2 



coth(x) 



X 



over [0, oo). 



□ 



Proof. From the previous proof, we have that the derivative of the expression to be minimized 
is 

(3 - j3)x 



x 2 sinh 2 (a;) 



x — (3 \ coth(rc) 



x 



9 



for x near zero. For > 3, it follows that x = is a local maximum of the expression on 
[0, oo). As x tends to infinity, the expression to be minimized is asymptotic to log(x), and 
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there is therefore at least one interior minimum. Since -A . A, , > 0, it must be the case 

that at this interior minimum, 

x-P ^coth(x) - = 0; 

that is, 





coth(x) — - 

In fact, the function g(x) = coth ^_ i is strictly increasing on (0, oo), and this equation thus 
uniquely determines x in terms of 0. First observe that 

coth(x) — - + — wr 

q (x) = q , 

(coth(x)-i) 2 

and it thus suffices to show that 

2 ^ jv_ 
x sinh 2 (x) 

for x > 0; multiplying through by xsinh 2 (a;), one could equivalently show that 

xsmh(x) cosh(x) + x 2 — 2 sinh 2 (x) > 0. 

Using the identities sinh(x) cosh (a;) = smh ( 2x } and sinh 2 (x) = co&h (^)~ 1 ; this is equivalent to 
showing that 

x ■ - * ■ 2 



coth(x) - - + i 2 > 



sinh(2x) + x — cosh(2a;) + 1 > 0. 
Expanding the left-hand side in Taylor series yields 

r o2re-l o2n 1 00 x 2n22n-2 



n=2 



Vx 2n ' — - = V — — fen -41 

^ 2(2n-l)! (2n)! ^ 1 



(2n) 



all of whose terms are indeed positive. 



□ 



Lemma 16. Let k?, denote the unique value of x G (0, oo) with 

x - (3 ^coth(x) - = 0. 

Then 



{k 2 2 sinh 2 (fc 2 )J <L 

In particular, if 

$p{x) := log ( , f ) + x ( coth(x) - - J - f ( coth(x) - - J 
\sitin(i) / \ x J 2 \ x J 

then <5>'p{k 2 ) = and $jg(Jfc 2 ) > 0. 

30 



Proof. Let 



fix) :— x — j3 ( coth(x) 

x 



so that f(k 2 ) = 0; as was shown in the previous proof, this uniquely defines k 2 in terms of /3. 
Moreover, it was also shown that \im x ^ f(x) = 0, lim^oo f(x) = oo, and lim x ^ f'( x ) < 0. 
That is, / is initially decreasing from 0, and then becomes increasing eventually, crossing 
the x-axis exactly once in (0, oo). It must therefore be that there is an x < k 2 such that 
f'(x) = 0. Now, 

' 1 1 



x- 



sinh (x) 



In fact, g(x) :- 



s inh 2 (a) * s decreasing on (0, oo): observe first that 



g'(x) = 2coth(x) csch 2 (x) — 



and so the claim is true if coth(x) csch 2 (x) < x 3 . By the definitions of the hyperbolic 
trigonometric functions, this is equivalent to 



4x 3 (e 4x + e 2x ) < (e 2x 



If 



Expanding in power series, this is equivalent to 



/ 



E 

n=3 



4(4 



n— 3 



in- 3> 



(n-3)! 



-x' 1 < 



E 



,n=l 



(2x) n 



in 



E 

n=3 



2" 



5^ 

i,j,k>l J 
\i-{-j+k=n 



X 



Letting a :- 



1, (3 := j — 1, and 7 = k — 1, the coefficient of x n on the right-hand side is 



2" 



(n-3)! 



E 



a,/3,7>0 
a+/3+7=n— 3 



n — 3 

«,/3,7 



jn— 3 



(n-3)! 



It is now easy to see that the coefficient of x n on the right-hand side is smaller than the one 



on the left for each n > 3, and so it is in fact true that g(x) :- 



i 



sinh (x) 



is decreasing on 



(0, oo). It follows that f'(x) is increasing on (0, oo), and so the previously identified x < k 2 
such that f'(x) = is in fact the only zero of /', and f'{k 2 ) > 0. 

Now, recall from the proof of the previous lemma that 



sinh (x) 



x — f3 ( coth(x) 



The value k 2 was in fact determined by the fact that $^(^2) = 0. Moreover, 



.r- 



sinh (x) 



P 



.r- 



sinh (x) 



+ 



2 coth(x) csch 2 (x) — 



x- 



x — f3 ( coth(x) — 



x 
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and so 



^2 



sinh (&2 



sinh (/c2 



> 



□ 



6.2 Proof of Theorem H21 

This section is devoted to the proof of the abstract approximation theorem used in Section 
[5j The following theorem is the starting point. 

Theorem 17. Let Y be a positive random variable. Then Y has density 



p(t) 



l t 5 e -3ct* t>Q 







t < 0. 



(19) 



if and only if 

E[Yf(Y) + 6(1 -cY 2 )f(Y)] =0 
for all f G (^((O, oo)) such that J Q °° f (t)t 5 e" 3rf2 dt < oo. 

Proof. If Y has the density above, then the fact that Y satisfies (fl9|) is just integration by 
parts. 

For the reverse implication, one must solve the so-called Stein equation; i.e., given h : 
(0, oo) -> M, find f = fh such that 

tf'it) + 6(1 - ct 2 )f{t) = h{t) - Eh(X), 

where X has the density above. The solution / is given by 

1 * 



f(t) 



tp(t) jo 



[h(s) -Eh(X)]p(s)ds 
[h(s)-Eh(X)]p(s)ds, 



where p(s) :- 



s b c -3cs 2 



Observe that 



so that 



j t [tf(t)p(t)} = [f(t)+tf(t)}p(t)+tf(t)p'(t) = [h(t)-Eh(X)]p(t), 

h(t) - Eh(X) = f(t) + tf(t) + tfit J^ {t) = 6(1 - ct 2 )f(t) + tf{t). 
Here we have made use of the fact, frequently used below, that 

P(t) 

It is shown in the next lemma that / and /' are both bounded, and so if Y satisfies (1T9~ 
then if h is given and / solves the Stein equation, 

Eh(Y) - Eh(X) = E [Yf'(Y) + 6(1 - cY 2 )f(Y)] = 0, 



and so Y = X. 



□ 
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The next lemma gives the required boundedness properties of fh- 
Lemma 18. Suppose f = fh is as defined above. Then 

(a) UMloo < 5||/i||oc- 

(b) Halloo <42V^||/i|U + 3||/ i / || 00 . 

( c ) ll/hlloo — Ci 1 1 ^ 1 1 oo + C2 1 1 ^ 7 1 1 00 + C*3 1 1 ^ // 1 1 00 , where C\, C%, C3 are constants depending only 



on c. 



Proof. (a) By the first expression for fh, if t < t := \/ q, then 



t 5 e 5 / 2 \\h\ 



< 



< 5 



By the second expression for fh, 



\f(t)\ < 



2\\h\\ 00 F[X>t] 
tp{t) 



It is easy to show directly that F[X > t] < ^ (l + 3^2 + oJt?) > an d so 



2 \ 84 1 



1 + -I < -"--ii^ 

6ct 2 V 3ct 2 9c 2 t 4 y ~ 125 

for t >t a . This completes the proof. 

(b) Recall that, because / solves the Stein equation, 

tf\t) = 6f{t){ct 2 - 1) + h{t) - Eh{X). 

For t <t Q , observe that 

h(t) - Eh(X) - 6f{t) = h[t) - Eh{X) ^- f [h(s) - Eh(X)]p(s)ds 

1 f [h{t) - Eh{X)]^^- - [h{s) - Eh{X)]p{s) )ds. 



tp(t) J 



t 5 



Now, 



< 



2\\h\ 



tpit) Jo 



s 5 p(t) 



t 5 p(s) 



P( s ) ds = , ,J° 

tp{t) JO 



1-e 



3c(s 2 -t 2 ) 



p(s)ds 



< 211/iHooC^, 
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making use of the fact that p(s) < p(t) in this range. Also, 
1 " 



tp(t) 



[h(t) - Eh(X)] - [h(s) - Eh(X)]Jp{s)ds 

Wh'W 

II II oo 



< 



f (t- s)p{s)ds < 
Jo 



\h'\\ t 



so that for t <t Q , 



and thus 



h{t) - Eh(X) - 6/(t) < 1211/iHooCto + 3\\h'\ 



\f(t)\ < Qct a \ u 
< A2ct n 



o || I oo 



12||/i||c* + 3||/i'| 
F3||/i'IL. 



If t > t Q , then 



\f'(t)\ <6ct|/(t)| + ^L /l|oo + 211 ' 



t, 



By the second expression for /, 



3t 



2 2 \ 
+ 3ct 2 + 9c 2 t 4 y 



making use again of the estimate F[X > t] < ^ (l + + g^i) . Taking t a — y ^ 
as before and making some trivial simplifying estimates completes the proof of this 
part. 



(c) Differentiating both sides of the Stein equation gives that 

tf"(t) = Uctfty) + (Get 2 - 7)f(t) + h'(t). 



If t < t , then 1 12c/ (t) | < ScH^U and by the Stein equation, \6ctf'(t)\ < 3601]/]]^ + 
12c|| || oo < 192c|| || oo. Also by the Stein equation, 



(20) 



s ( 1\ h(t)-Eh(X) 
f'(t) = 6/(t) let --)+ U - 1 ; . 



The first term can be absorbed into the existing bound, so that 



\f"(t)\<C\\h\l 



1 



h'(t) 



42/(0 - 7[h(t) - E/ipT)] 
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(From now on we will not bother to keep track of specific constants and their depen- 
dence on c.) Now, 



h'(t) 



7[h(t)-Eh(X)-6f(t)} 



t 



h'(t) - 



12 



t 2 p(t) 
42 rt 



o 



\h{t) - Eh{X)]^jQ - [h(s) - Eh(X)]p(s) 



t 2 p(t) Jo 
42 



[h(t) - Eh(X)}e- 3ct " - [h{s) - Eh{X)}e 



-3cs 2 



2\ S 



t 2 p(t) Jo 

Let H(t) := [h(t) - Eh(X)]e~ 3ct2 . Then 

H'{t) = [h'(t) - Qct[h(t) - Eh(X)]]e 
and so the equation above becomes 
7[h(t) - Eh(X) - 6/(t)] 



(t-s)h'(t)e- 3ct )-ds. 



ds 

5 



h'(t) 



t 



42 



pt 5 

J (h(s) - H(t) - (s - t)H'(t) + Qct(t - s) [h{t) - E/i(X)]e~ 3c<2 ) S —ds. 



t 2 p(t) 
It follows that 

7[h(t) - Eh(X) - Qf(t)} 



h'{t) 
Now, 



t 



< 



12 



t 2 p(t) 



sup \H"(s) 

s&(0,t) 



ct 



12 z J z 



— [h{t)-Eh{X)]e 



H"(s) = [h"(s) - 12csh'{s) - 6c(6cs 2 - l)[h{s) - Eh{X)}] 

so 

sup \H"(s)\ < \\h"\\ 0O + 12ct o \\h'\\ oo + 24c\\h\\ oo . 

sG(0,t) 

All together, this gives that for t < t Q , 

\f"(t)\<C 1 \\h\\ 00 + C 2 \\h'\\ 00 +\\h"\\ 00 , 



-3cs 2 



where C±, C*2, C3 are constants depending only on c. 

For t > t Q , it follows from ( 120]) and estimates already carried out that 



If Wl < 12c 



< Ca 



6ct+l)\f'(t)\ 



\h'\ 



4 00 



+ C 5 \\h'\\ 00 + ct\f(t)\. 
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By the Stein equation, 

ctf{t) = Qcf(t)(ct 2 - 1) + ch{t) - cEh(X), 

and 



pit) 3 V 3rf2 9c2 * 4 7 ' 

and so finally 

|/"(*)| < C 6 |N|oo + GiU/l'lloo. 

□ 

With the boundedness lemma, we can now give the proof of Theorem[12j For convenience, 
we recall the statement. 

Theorem 1121 Let (W, W) be an exchangeable pair of positive random variables. Suppose 
there exists a a-field H D o~(W) and k > deterministic such that 

E[W - W\3r] = 3k(l - cW 2 ) + E 

and 

E[{W - W) 2 \3] = kW + E'. 

Let X have density 

p(t) = {* 

[0 t<0. 

Then there are constants Ci,C 2 , C 3 depending only on c such that for all h e C 2 (R), 
\Eh(W) - Eh(X)\ < ^^E\E\ + ^(IN|oo + ||/^|oo) ^ E| ^! 

Proof. Given h, let / be the solution to the Stein equation described above. Then by 
exchangeability and the conditions on (W, W), 

= E[(W'-W)(f(W') + f(W))] 
= E[{W - W)(f(W) - f(W)) + 2(W - W)f{W)} 
= E[(W' - W) 2 f (W) + E" + 6Jfc(l - cW 2 )f(W) + 2Ef(W)\ 
= E[kWf(W) + E'f'(W) + E" + 6Jfe(l - cW 2 )f(W) + 2Ef(W)]. 



Then 



and 



E[Wf'(W) + 6(1 - cW 2 )f(W)] = ~E[E'f'(W) + 2Ef(W) + E"\ 

\e"\ < M3^\(w>- W )\\ 



The result is thus immediate from Lemma |_ 

□ 
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